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Abstract 

Background: An in silico analysis of the mitochondrial protein import apparatus from a variety of species; including 
Chlamydomonas reinhardtii, Chlorella variabilis, Ectocarpus siliculosus, Cyanidioschyzon merolae, Physcomitrella patens, 
Selaginella moellendorffii, Picea glauca, Oryza sativa and Arabidopsis thaliana was undertaken to determine if 
components differed within and between plant and non-plant species. 

Results: The channel forming subunits of the outer membrane components Tom40 and Sam50 are conserved 
between plant groups and other eukaryotes. In contrast, the receptor component(s) in green plants, particularly 
Tom20, (C. reinhardtii, C. variabilis, P. patens, S. moellendorffii, P. glauca, O. sativa and A. thaliana) are specific to this 
lineage. Red algae contain a Tom22 receptor that is orthologous to yeast Tom22. Furthermore, plant mitochondrial 
receptors display differences between various plant lineages. These are evidenced by distinctive motifs in all plant 
Metaxins, which are absent in red algae, and the presence of the outer membrane receptor OM64 in Angiosperms 
(rice and Arabidopsis), but not in lycophytes (S. moellendorffii) and gymnosperms (P. glauca). Furthermore, although 
the intermembrane space receptor Mia40 is conserved across a wide phylogenetic range, its function differs 
between lineages. In all plant lineages, Tim17 contains a C-terminal extension, which may act as a receptor 
component for the import of nucleic acids into plant mitochondria. 

Conclusions: It is proposed that the observed functional divergences are due to the selective pressure to sort 
proteins between mitochondria and chloroplasts, resulting in differences in protein receptor components between 
plant groups and other organisms. Additionally, diversity of receptor components is observed within the plant 
kingdom. Even when receptor components are orthologous across plant and non-plant species, it appears that the 
functions of these have expanded or diverged in a lineage specific manner. 



Background 

The endosymbiotic event giving rise to the origin of 
mitochondria is thought to have occurred 1 to 2 billion 
years ago [1,2]. Details of the conditions that favoured 
this event and the exact identity of the host cell that 
engulfed the a-proteobacterial cell are still unclear. It 
has been proposed that the endosymbiosis that gave rise 
to mitochondria occurred under anaerobic conditions, 
followed by early diversification of eukaryotic cells [3]. 
For plastids, an endosymbiotic event occurred ~1 billion 
years ago when a heterocyst forming cyanobacterium 
was engulfed [4,5]. Over time the loss and/or transfer of 
genes and genomes from the endosymbionts to the host 
cell nucleus has resulted in the formation of organelles 
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with limited coding capacity [6-8]. The majority of pro- 
teins located in mitochondria and plastids are encoded 
by nuclear located genes, translated in the cytosol and 
imported into these organelles. Notably, the proteomes 
of both mitochondria and chloroplasts are derived from 
a variety of sources and are not simply a subset of the 
proteins derived from the ancestral endosymbiont [9]. In 
the most extreme cases, it is thought that all genes that 
were present in the endosymbiont have been lost, result- 
ing in specialized organelles such as hydrogenosomes 
and mitosomes [10]. 

Although mitochondria have a single origin there is 
variation observed between different mitochondria pre- 
sent in the major branches of life [10]. Mitochondria in 
plants contain many unique features compared to their 
fungal or animal counterparts. These include a larger 
genome, ranging from 200 Kb to 2000 Kb in size [11], 
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extensive cis and trans splicing of introns, [12], relatively 
slow rates of mutations [13,14], extensive editing of 
mRNA [15] and incorporation of foreign DNA [16]. 
Another notable feature is the presence of a branched 
respiratory chain [17]. Although fungi contain alterna- 
tive NAD(P)H dehydrogenases and an alternative oxi- 
dase, these are usually only expressed under conditions 
where the cytochrome chain is inhibited [17]. In con- 
trast, plant mitochondria contain components of the 
alternative respiratory pathways which exhibit both con- 
stitutive and stress induced expression [18]. Further- 
more, mitochondria of plants and animals have 
diversified in a lineage specific manner to include or 
exclude various biochemical pathways, such as the 
P -oxidation of fatty acids that occurs in peroxisomes in 
plants and mitochondria in animals [19]. 

In plants, the presence of plastids in cells also adds to 
the complexity of protein sorting required to avoid 
mis-targeting of proteins to organelles. Plastidic and 
mitochondrial targeting signals, referred to as transit 
peptides and presequences respectively, are typically 
located at the N-terminal end of the protein and are 
enriched in positively charged residues such as lysine 
and arginine [20]. It is not known how mis-sorting of 
proteins is prevented between plastids and mitochon- 
dria. A combination of the predicted ability of transit 
peptides and presequences to form different secondary 
structures, the proposed presence of cytosolic targeting 
factors and even targeting of mRNA to the surface of 
organelles, may all combine to achieve the observed spe- 
cificity of protein targeting [21,22]. There is a mechanis- 
tic difference between recognition of targeting signals by 
preprotein receptor proteins in plastids and mitochon- 
dria, the former involving a GTP/GDP cycle while no 
energy requirement is observed for receptor binding in 
mitochondria. This mechanism among others may con- 
tribute to the specificity of targeting signal recognition 
at the surface of each organelle [23,24]. 

Our knowledge of the mitochondrial protein import 
apparatus in plants, both experimental and predicted, is 
largely derived from studies in Arabidopsis, and to a les- 
ser extent from Solatium tuberosum (potato). Purifica- 
tion of the translocase of the outer membrane (TOM) 
complex from both Arabidopsis and potato revealed 
that Tom40 and Tom7 are orthologous with those from 
yeast, while Tom20 is not orthologous to yeast or mam- 
malian proteins [25-27]. The other import receptors 
characterized in yeast (and mammals), Tom70 and 
Tom22, appear to be absent [28,29]. It has been shown 
that plant Tom9 is the most likely equivalent to yeast 
Tom22, but lacks the cytosolic receptor domain [30]. 
The mitochondrial processing peptidase has been puri- 
fied from potato and shown to be integrated into the 
cytochrome bcj complex [31,32]. This is also the case in 



lower plants examined both in the elkhorn fern Platy- 
cerium bifurcatum and the field horsetail Equisetum 
arvense [33]. Biochemical purification of the prese- 
quence degradation peptidase (PreP) has shown that it 
is a dual targeted protein and that it is a zinc metallo- 
protease [34]. Biochemical studies have shown that 
small intermembrane space proteins also mediate mito- 
chondrial carrier protein import in potato mitochondria. 
In addition, the plant TIM17:23 complex differs to that 
in yeast in that the Timl7 in Arabidopsis contains a C- 
terminal extension that must be removed before it can 
complement a timl7 mutant in yeast [35,36]. 

However, there are limited studies on the nature of 
the mitochondrial protein import apparatus from other 
plants, ranging from single celled algae to monocots. 
Thus, in order to gain a better overview of the protein 
import apparatus in plants, compared to fungal and ani- 
mal counterparts, an in silico analysis of these compo- 
nents was carried out. This was based on the fact that 
complete genome sequences now exist for the single 
celled green algae, Chlamydomonas reinhardtii (Chloro- 
phyte) [37] and Chlorella variabilis, an intracellular sin- 
gle celled green algae photosynthetic symbiont in 
Paramecium bursaria [38], a moss, Physcomitrella 
patens (Bryophyte) [39], Selaginella moellendorffii, an 
ancient vascular plant [40], and higher plants Oryza 
sativa [41], Arabidopsis thaliana [42] and Picea glauca 
[43] (Spermatophytes) (Figure 1). We have also included 
analysis from brown algae, Ectocarpus siliculosus 
(Phaeophyceae) [44] and the red algae, Cyanidioschyzon 
merolae (Rhodophyta) [45] (Figure 1). Red algae repre- 
sent a cell lineage with a primary plastid endosymbiosis 
that is proposed to have been derived from the same 
event that gave rise to the plastids in green plants, but 
diverged from the green plant lineage early after this 
endosymbiotic event [46]. Brown algae have obtained 
their plastids via a secondary endosymbiosis, and con- 
tain four plastid envelope membranes. Thus, plastid pro- 
teins are first targeted to the outer membrane via a 
hydrophobic signal sequence and secondary targeting 
signals mediate uptake into plastids [47]. 

Results and Discussion 

Translocase of the Outer Membrane (TOM) 

The TOM complex represents the gateway into mito- 
chondria, through which almost all mitochondrial pro- 
teins pass (exceptions include Fisl [48]). It has been 
characterized from yeast, Neurospora, mammals and 
plants, in particular Arabidopsis. In addition to being 
purified from Arabidopsis, functional studies on the 
Tom20 receptor components show that all three iso- 
forms can be deleted, resulting in a reduced rate of 
import for several precursor proteins, but no deleterious 
phenotypic lesions. The complex typically contains 7 
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Figure 1 Overview of the evolutionary relationship of organisms used in this study The taxonomy database at NCBI was used to draw a 
phylogenetic tree, which was visualized using PHY-PI [91], The timeline is based upon [57]. 



subunits, Tom70, 40, 22, 20 7, 6, and 5, but in Arabi- 
dopsis Tom22 is replaced by Tom9 and no orthologue 
to Tom70 can be identified. Arabidopsis contain a pro- 
tein termed OM64 that is not present in yeast or mam- 
mals, which appears to play a role as an import 
receptor. 

The TOM complex fulfills the vital function of specifi- 
cally recognizing mitochondrial proteins from the pool 
of all proteins synthesized in the cytosol. Of the seven 
components characterized biochemically to be present 
in the TOM complex from yeast, only Tom40 is con- 
served between yeast, mammals and plants (Figure 2, 
Additional file 1). While Tom40 is a |3-barrel protein, 
there is no significant sequence similarity with bacterial 
P-barrel proteins [30], nonetheless, hidden Markov 
model searches define this as a universal component of 
all mitochondria, including mitosomes in Entamoeba 
histolytica and Giardia intestinalis [49,50]. 

Tom22 has been shown to fulfill a central receptor 
role in yeast, and insertional inactivation in yeast results 
in a strong impairment of mitochondrial biogenesis, 
compared to the other two preprotein receptors charac- 
terized, Tom20 and Tom70 [51]. Searching the genome 
of the red algae C. merolae for Tom22-like proteins 
identified a protein with a predicted molecular mass of 
20 kDa. This protein displays sequence identity and a 
similar domain structure to the yeast Tom22 (Figure 2, 
Figure 3b). Thus, the TOM complex of C. merolae 
appears to be similar to that of D. discoideum, in that it 
contains a single receptor Tom22-like protein [49]. In 
contrast, green plants and E. siliculosus do not contain a 



Tom22 protein. Rather, they contain a Tom9 protein 
domain component (Figure 2, Additional file 1). Plant 
Tom9 is predicted to be structurally similar to yeast and 
mammalian Tom22, except that it lacks the cytosolic 
receptor domain [30]. Thus, Tom22 has either lost the 
receptor domain to form Tom9 or been replaced by a 
different protein. Irrespective of the mechanisms by 
which Tom9 arose, it appears that green plants have 
lost the Tom22 receptor. The presence of the Tom22 
receptor component in C. merolae and D. discoideum 
suggests that it represented a universal mitochondrial 
receptor component prior to the divergence event that 
gave rise to plants verse animals and fungi. 

None of the receptor proteins characterized in yeast 
or mammalian systems, Tom20, Tom70 and Tom22, are 
present in green plants [52,53] (Figure 2). The evolu- 
tionary situation for Tom22 is outlined above, and 
although a Tom20 receptor protein is present in plants, 
it represents a case of convergent evolution that has 
been previously well described [27,54,55], thus, plant 
and yeast Tom20 proteins are not orthologous (Figure 2, 
Additional file 1). The third receptor component, 
Tom70, is only present in animals and fungi [45] (Fig- 
ure 1). Tom70 is not present in any green plant genome 
[29], a variety of searches in this study failed to detect 
any Tom70 like sequences in the green plant genomes 
interrogated. However, in the genome of the brown 
algae E. siliculosus, a protein with a similar domain 
structure to Tom70 was identified (Figure 3C). This 
Tom70 like protein contains an N-terminal transmem- 
brane domain and 11 Tetratricopeptide repeat (TPR) 
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Figure 2 Diversity of the TOM complex in plants. A) Schematic diagrams of the TOM complex from a selection of plant species using the 
TOM complex from yeast as a reference. B) A table depicting components of the TOM complex in a variety of organisms. The pink color refers 
to proteins that are conserved across all organisms and likely have a common ancestor. The lime green colored proteins are specific to the plant 
ineage. The pale green proteins are proteins that have an unknown origin. Yeast - Saccharomyces cerevisiae, Ectocarpus - Ectocarpus siliculosus 
(Es), Cyanidioschyzon - Cyanidioschyzon merolae (Cm), Chlorella - Chlorella variabilis (Cv), Picea - Picea giauca (Pg), Selaginella - Selaginella 
moellendorffii (Sm), Chlamydomonas - Chlamydomonas reinhardtii (Cr), Physcomitrella - Physcomitrella patens (Pp), Rice - Oryza sativa (Os), 

Arabidopsis - Arabidopsis thaliana (At), Human - Homo sapiens, Poplar -Populus tricocarpa (Pt), Glycine - Glycine max (Gm) Zea - Zea mays (Zm). 
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Figure 3 Analysis of components of the plant TOM complex. A) Phylogenetic tree of the chloroplastic Toc64 and mitochondrial OM64 
sequences from plants. Only sequences which showed the characteristic domain structure of Toc64, an N-terminal transmembrane domain 
followed by an amidase like domain with 3 TPR repeats at the C-terminus, were used for phylogenetic analyses. No such proteins were 
identified in Ectocarpus siliculosus, Cyanidioschyzon merolae, Chlamydomonas reinhardtii and Chlorella variabilis. For most plant species a clear 
Toc64 and mtOM64 homologue can be identified however only 1 sequence can be identified in Physcomitrella patens, Selaginella moellendorffii 
and Picea glauca which all branch closest to the Toc64 chloroplastic proteins. B) Sequence alignment of the Saccharomyces cerevisiae Tom22 
with the Arabidopsis thaliana Tom9 and Cyanidioschyzon merloae Tom22. While most plants contain only the Arabidopsis Tom9 like protein 
Cyanidioscyzon contains a full Tom22 receptor, which shows a high similarity with the yeast Tom22. C) The domain organization of the yeast 
Tom70, Arabidopsis mtOM64 and an EsTom70 like protein. TM - transmembrane domain and the numbers correspond to the TPR repeats. At - 
Arabidopsis thaliana, Vv - Vitis vinifera, Gm - Glycine max, Pt - Populus tricocarpa, Os - Oryza sativa, Pp - Physcomitrella patens, Zm - lea mays, Sc - 
Saccharomyces cerevisiae, Cm - Cyanidioschyzon merolae, Es - Ectocarpius siliculosus, Sm - Selaginella moellendorffii and Pg - Picea glauca. 
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motifs similar to yeast Tom70 [29]. However, the level 
of sequence identity is low (20%), and it is unclear if 
this protein represents a Tom70 orthologue. 

A protein with a predicted molecular mass of 64 kDa 
(OM64) is found on the outer membrane of mitochon- 
dria in Arabidopsis, displaying -70% sequence identity 
with the Toc64 protein (translocase of the outer envel- 
ope of chloroplasts) from plastids [56] (Figure 3A). In 
plant mitochondria, this protein has been shown to be 
involved in the import of some precursor proteins [53]. 
Analysis of various plant genomes reveals that OM64 
appears to be present only in a sub-set of vascular 
plants and is absent in P. glauca and S. moellendorffii, 
as well as lower plant groups represented by C. rein- 
hardtii, C. variabilis, P. patens, E. siliculosus and 
C. merolae (Figure 2A). Tom7 represents an interesting 
case in that it is absent in C. merolae, but present in all 
other plants and eukaryotes (Figure 2). TBlastx and hid- 
den Markov model based searches of all red algae gen- 
omes available failed to find this component [50]. Thus 
even if it was not annotated in the genome sequence of 
C. merolae these searches should detect its presence. 
However, it cannot be ruled out that it may have been 
missed in the sequencing and/or assembly of the 
C. merolae genome. Tom5 and Tom6, proteins of 
approximately 50 amino acids long, were not detected 
in C. variabilis, using either plant or yeast interrogation 
sequences in searches. However, the small size of Tom5 
and 6 proteins means that it is difficult to define their 
evolutionary relationship across wide phylogenetic gaps. 
Tom7, on the other hand appears to be orthologous 
across all groups, with the exception that it cannot be 
found in C. merolae. Thus suggesting that the small 
TOM proteins may be lineage specific, as is the case of 
the Tom20 receptor. 

It is evident that the TOM complex of plants displays 
diversity with respect to the receptor components pre- 
sent. While Tom40 is universally present, the presence 
of Tom20 is only evident in green plants, Tom70 is only 
present in E. siliculosus, and OM64 appears to have 
arisen by a relatively recent evolutionary event as it is 
only present in a variety of monocot and dicot plants 
examined and could not be detected in P. glauca and 
S. moellendorffii (Figure 2A and 3A). The brown algae 
E. siliculosus, contains a Tom70 type receptor. As there 
is no Tom70 like sequences in green plants [29], the 
Tom70 type receptor was either derived from the speci- 
fic host in the symbiosis that led to the formation of 
brown algae, or alternatively, it may represent a case of 
convergent evolution, as has been observed between 
green plants and Opisthokonts for the Tom20 receptor 
[27,55]. 

An analysis of the mitochondrial protein import appa- 
ratus in a variety of plants reveals that C. merolae 



clearly contains a Tom22 type receptor in contrast to all 
other plant lineages. Thus, this component may either 
have been lost from brown algae and green plant 
lineages or the presence of a Tom22 type receptor in 

C. merolae represents another case of convergent evolu- 
tion. As brown algae are proposed to have been derived 
from red algae, after the latter branched from green 
plants [57], the Tom22 receptor would have to be lost 
independently in green plants and brown algae. How- 
ever, caution needs to be exercised, as the sequence of 
E. siliculosus may not be fully representative of all 
brown algae. 

The question of how red algae solve the sorting pro- 
blem between plastids and mitochondria may relate to 
the binding substrates of the receptors, that is, the tar- 
geting signals. Analysis of plastid targeting signals from 
all plant lineages reveals that red algae (and the other 
primary plant lineage, glaucophytes) contain a phenyla- 
lanine residue within a few amino acids of the 
N-terminus, which is in a hydrophobic context [58]. 
This 'ancestral' plastid targeting motif is not present in 
plastid targeting signals in green plants [58], and thus 
the differentiation of plastid and mitochondrial targeting 
signals in green plants differs to red algae. In red algae, 
the "phenylalanine containing" transit peptide may serve 
as a means for mitochondria and plastid targeting sig- 
nals to be recognized or rejected by plastidic or mito- 
chondrial receptors respectively. 

Sorting and Assembly Machinery of the Outer 
Mitochondrial Membrane (SAM) 

The SAM complex is required for the insertion of 
P -barrel and a -helical proteins into the outer membrane 
[52]. The insertion of P -barrel proteins into the mito- 
chondrial outer membrane is conserved from bacteria to 
mitochondria and plastids, where Omp85, Sam50 and 
Toc75 are orthologous P -barrel proteins that are essen- 
tial for this process [59,60]. However, apart from this 
central component, there are no other conserved com- 
ponents identified for the insertion of P -barrel proteins 
into membranes from bacteria to mitochondria and 
plastids (Figure 4A and 4B). In yeast, four additional 
components are involved; Sam35, Sam37, MdmlO and 
Miml, with Sam35 representing an essential component. 
As the SAM complex has not been biochemically char- 
acterised from mammalian or plant systems, any addi- 
tional components are unknown in these systems. The 
genome of D. discoideum has a gene encoding Sam50, 
but lacks the other components identified in yeast. As 

D. discoideum is an amoeba that diverged from Opistho- 
konts after this lineage had split from plants, this sug- 
gests that the additional components observed in yeast 
arose after the lineage divergence of plants from other 
groups. Although additional components are likely to be 
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Figure 4 The SAM complex of plants. A) Schematic representation of the SAM complexes found in yeast and plants. B) Table indicating the 
presence or absence of a component of the SAM complexes found in plants. C) Representation of the different domains of yeast Sam37, 
Human Metaxin and Arabidopsis metaxin proteins. All three share similar Glutathione S-transferase (GST) domains. However the location of the 
transmembrane domains (TM) differs. A motif analysis search of plant metaxin sequences identified Motif 1 to regions between amino acids 37 
and 79 and motif 2 between amino acids 1 04 and 1 55 in Arabidopsis. These motifs appear only in plant like Metaxins and near a region 
required for protein binding. Colors and abbreviations are the same as Figure 2. 
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present in the SAM complexes from plants, they are 
unlikely to be orthologous to the components in yeast. 

In Arabidopsis, a protein called Metaxin has been 
shown to be involved in the import of P -barrel proteins 
into the outer membrane. The deletion of Metaxin is 
not lethal in Arabidopsis, although plants are sterile and 
grow poorly [53] . Deletion of Metaxin results in a large 
up-regulation of transcript abundance for genes encod- 
ing the mitochondrial P-barrel proteins porin and 
Tom40, with an accumulation of porin observed in the 
cytosol, indicating that Metaxin plays a role in the inser- 
tion of P-barrel proteins in plants. Mammalian genomes 
contain two genes encoding Metaxin, in fact the plant 
Metaxin protein was identified using blast searches of 
the mammalian Metaxin protein [53] (Figure 4C). Mam- 
malian Metaxin has also been shown to be involved in 
the import of P-barrel proteins into the outer membrane 
of mitochondria [61]. Mammalian Metaxin does identify 
with Sam37 in a blast search, although the sequence 
identity is very low (Additional file 1). A number of fea- 
tures distinguish plant and animal Metaxins from 
Sam37 in yeast. Firstly, human and Arabidopsis Metax- 
ins are anchored to the outer membrane in the opposite 
orientation compared to Sam37 (Figure 4C). Yeast 
Sam37 is anchored to the mitochondrial outer mem- 
brane by an N-terminal transmembrane domain, 
whereas human and Arabidopsis Metaxins contain 
C-terminal transmembrane domains. Secondly, human 
and Arabidopsis Metaxins contain conserved glutathione 
S-transferase (GST) domains (Figure 4C). Plant and ani- 
mal Metaxins are distinguished by the fact that Metaxin 
is not found in a complex with human Sam50 [61], 
whereas plant Metaxin is in a complex with plant 
Sam50 (Duncan and Whelan - unpublished data). In 
Arabidopsis Metaxin there are two conserved motifs in 
a region critical for binding that are only found in plant 
type Metaxins (Figure 4C motif 1 and 2). It is also of 
interest to note that while Trypanosomes do contain a 
Metaxin like protein [62], there are no Metaxin or 
Sam37 like proteins identified in D. discoideum [49] or 
C. merolae in this study. The presence of a Metaxin 
protein in E. siliculosus may be derived from the host 
cell. Thus, plant and animal Metaxins may be ortholo- 
gous, but functions are likely to have diverged over 
time. Biochemical characterization of the plant SAM 
complex would provide information on the accessory 
proteins of this complex and provide a clearer picture of 
the evolutionary nature of the accessory subunits in this 
complex. 

Intermembrane space - Mitochondrial intermembrane 
space import and Assembly (MIA) and Tiny TIMs 

The intermembrane space contains two sets of proteins 
that are essential for cell viability in yeast. The tiny TIM 



proteins 8, 9, 10 and 13 appear to be present in a wide 
variety of eukaryotes (Figure 5A). They play an essential 
role in the import of carrier proteins into the inner 
membrane and also the assembly of P-barrel proteins 
into the outer membrane [52]. It has been proposed 
that they arose from an ancestral protein present in the 
original host that housed the mitochondrial endosym- 
biont [63]. There are eukaryotes that lack the small 
Tims {Trichomonas vaginalis and Encephalitozoon cuni- 
culi) or only contain one small Tim protein {Cryptospor- 
idium hominis) [64,65], indicating that they are not 
absolutely essential, even though these organisms con- 
tain carrier type proteins on the inner membrane that 
should require these components for import. Thus, the 
lack of small Tims is likely to be a derived situation 
associated with the presence of highly modified mito- 
chondria (i.e. mitosomes) in these organisms. 

The MIA pathway is the most recently described 
import pathway for mitochondrial proteins. It consists 
of two essential proteins in yeast, Mia40 and Ervl, 
which catalyse the oxidative folding of proteins when 
they enter the intermembrane space. Substrates of this 
pathway are proteins that contain conserved cysteine 
residues that undergo oxidative protein folding in the 
intermembrane space. Both Mia40 and Ervl are essen- 
tial proteins in yeast, with Mia40 proposed to act as the 
intermembrane space receptor for proteins [66]. Whilst 
detailed structural and mechanistic analysis has been 
carried out on this system in yeast [67], little is known 
about the components in other organisms. Interestingly 
the apparent lack of a gene encoding Mia40 in trypano- 
somes suggests that this pathway may display variations 
between species [68]. 

For the MIA machinery, orthologues of Mia40 and 
Ervl are present in yeast, humans and plants (Figure 
5A). Although Hotl3 has been reported to be wide- 
spread in eukaryotes, analysis of the proteins identified 
indicates it is -600 amino acids long and most likely a 
transcription factor in green plants. Brown algae contain 
a protein similar to yeast Hotl3 (Figure 5A, Additional 
file 1). However, given the small size of this protein 
with conserved metal domains it is unclear if it is ortho- 
logous to the yeast protein. Although plant Ervl and 
Mia40 are orthologous to their yeast counterparts, the 
primary structure of the protein differs (Figure 5B), sug- 
gesting possible mechanistic differences. Deletion of 
Mia40 in Arabidopsis is not lethal, and in fact normal 
growth and development are observed [69]. Ervl is 
essential in Arabidopsis and analysis of the primary 
sequence indicates that the arrangement of cysteines dif- 
fers to that in yeast (and humans). Arabidopsis Ervl is 
similar to that found in Trypanosoma brucei and a pro- 
tein called Erv2 that is located in the endoplasmic reti- 
culum of yeast (which operates without a Mia40 like 
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Figure 5 Components of the MIA and IMS protein import apparatus of plants. A) A table displaying the components of the small Tim 
proteins and MIA pathway components of plants. B) Schematic diagram of the different Erv1 sequences found in different organisms. The grey 
region represents the most conserved region between different Ervl sequences containing the redox centre, the FAD binding site and two 
conserved cysteine domains (CXXC and C-16-C). The location of the third cysteine pair differs between organisms which seems to be dependent 
on either a Mia40 protein being present or whether that Mia40 is essential or not. Abbreviations are the same as Figures 2 and 4. 
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protein) (Figure 5B). Given that the import of the small 
Tim proteins and carrier proteins is normal in Arabi- 
dopsis plants that lack Mia40 [69], mechanistically Ervl 
can function without Mia40 in oxidative protein folding 
in the intermembrane space. Analysis of the genome of 
D. discoideum indicates that Mia40 is present [68]. As 
Mia40 is absent in Trypanosoma brucei, Encephalitozoon 
cuniculi [65,68], and the brown algae E. siliculosus 
(Figure 5A, Additional file 1), this suggests that the pre- 
sence of Mia40 in plants is a primitive situation, but its 
function(s) differ in various lineages. 

Translocases of the inner membrane (TIMS) 

The inner mitochondrial membrane contains two trans- 
locases, the TIM17:23 complex that is responsible for 
the import of proteins via the general import pathway, 
and the TIM22 complex, that is responsible for the 
import of carrier proteins into the inner membrane. The 
TIM17:23 complex is responsible for the import of pro- 
teins that contain N-terminal targeting signals into or 
across the inner mitochondrial membrane [23]. This 
complex contains 9 components in yeast and several of 
the components are essential in yeast [52]. Tim23 forms 
a presequence and voltage sensitive channel [70], while 
Timl7 plays a crucial role in voltage sensing [71,72]. 
The TIM17:23 complex can be divided into the PAM 
complex, the presequence assisted motor consisting of 
five subunits (Tim44, HSP70, Pam 16, 17 and 18) and 
the membrane components of Timl7, 23, 21 and 50. 
The TIM22 translocase is responsible for the import of 
proteins that contain internal targeting signals and con- 
tain multiple (4 or 6) transmembrane spanning regions 
into the inner membrane [23]. In contrast to the 
TIM17:23 complex, the mechanistic details of how it 
operates are not yet fully understood. However Tim22 
has been shown to have channel activity that is only 
active in the presence of a substrate protein [73]. In 
yeast it contains three accessory proteins, Tim54, 18 
and 12 [52]. However no details on the composition of 
this complex in other organisms have been reported. 

In contrast to the TOM complex on the outer mem- 
brane, eight of the nine components of the Timl7:23 
complex are conserved between yeast, humans and 
plants (the only difference being that yeast contain a 
Paml7 protein not present in humans and plants) 
(Figure 6A, Additional file 1). It has been previously 
proposed that the channel forming subunits of this com- 
plex, Tim23 and Timl7, are derived from amino acid 
transporters in bacteria, specifically LivH, and defined a 
family of proteins termed PReprotein and Amino acid 
Transporters (PRAT) [74]. It seems that originally there 
was one PRAT type protein that subsequently diverged 
to give rise to the three different PRAT proteins typi- 
cally found in mitochondria [75]. In some organisms a 



single PRAT protein exists, which is likely a derived 
condition where the other PRAT proteins have been 
lost [65]. Additionally, not all subunits of the TIM17:23 
complex are observed in all organisms, i.e. the absence 
of Tim50 in D. discoideum [49], suggests that accessory 
subunits can be lost. 

In the case of the TIM22 complex, the translocase 
responsible for the import of metabolite carriers, or 
multiple spanning proteins of the inner membrane, 
including the PRAT proteins themselves, only the 
Tim22 component is conserved. In fact Timl8 and 
Tim54, along with Timl2, are only found in yeast and 
not in other organisms including plants. Thus, the addi- 
tional components of this translocase are yet to be char- 
acterized in other organisms. 

Although the TIMs seem to be better conserved in 
terms of orthology compared to the TOM complex, 
there are notable differences in plants. Firstly, the family 
of PRAT proteins has greatly expanded in plants com- 
pared to yeast and mammals. In Arabidopsis there are 
17 members, rice has greater than 24 members and 
examination of C. reinhardtii and P. patens reveal 5 and 
21 members respectively. Some of these PRAT proteins 
are located in plastids, while others are found in mito- 
chondria [76]. In addition to the greater number of 
PRAT proteins in plant genomes, Timl7 in plants varies 
in size from 133 amino acids to 252 amino acids, the 
difference to yeast Timl7 of 158 amino acids is at the 
C-terminal end of the protein (Additional file 1). 
A C-terminal extension is found on Timl7 proteins 
from C. merolae through to Arabidopsis. It has been 
shown in Arabidopsis that this C-terminal extension is 
exposed on the outer surface of the outer membrane 
[36] and Arabidopsis Timl7 can only complement a 
yeast Timl7 mutant if this extension is removed [36]. 

In order to investigate possible function(s) of the 
Timl7 C-terminal extension in plants we conducted a 
motif search on all the identified Timl7 extensions. 
A distinct motif was detected (Figure 6C). Using this 
motif in blast searches identifies a number of different 
nucleic acid binding proteins (Additional file 2). This 
suggests that Timl7 in plants may be able to bind RNA 
and/or DNA. As plant mitochondria import tRNAs and 
have recently been shown to bind mRNA [22,77], sug- 
gests a possible role for Timl7 in the binding and/or 
import of nucleic acids into plant mitochondria. 

Mitochondrial processing peptidase(s) 

Mitochondria require a number of peptidases to remove 
the targeting signals from proteins before or after they 
are assembled into functional protein complexes. These 
peptidases range from activities that remove the target- 
ing signals, such as mitochondrial processing peptidase 
(MPP) and intermediate processing peptidase (IMP) 
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Figure 6 The TIM17:23 and TIM22 complexes of plants. A) A schematic representation of the TIM1 7:23 complex from yeast and plants. 
B) A table displaying the components of the TIM1 7:23 and TIM22 complexes in all plant species analysed. C) A diagram showing Arabidopsis 
Tim17-2 that contains four transmembrane domains (TM) and an extra C-terminal extension. This C-terminal extension is found in all higher 
plant species and motif analysis of the C-terminal extension identified to highly conserved regions of which both are related to nucleic acid 
binding proteins. Colors and abbreviations are the same as Figures 2 and 4. 
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[23], to the removal of a single amino acid from pro- 
teins that have already had the targeting signal removed 
[78], to the presequence degrading peptidase (PreP) that 
degrades the targeting signals once they have been 
removed [34]. 

Plant mitochondria contain a number of orthologous 
proteins in comparison with the various processing pep- 
tidases of yeast mitochondria (Additional file 1). At a 
sequence level the processing peptidases found in plant 
mitochondria look similar to those of yeast, however, 
functional investigations have revealed a number of dif- 
ferences. One major difference between yeast and plants 
is the location of the mitochondrial processing peptidase 
(MPP). In yeast both a and P MPP subunits are located 
in the matrix, however, in plants it has been demon- 
strated that they are integrated into the cytochrome be! 
complex located in the inner membrane [31,32]. Addi- 
tionally, the presequence degrading peptidase of yeast is 
located in the intermembrane space whereas in plants it 
is located in the matrix [79,80]. Interestingly, the prese- 
quence degrading peptidase of plant mitochondria is 
also dual targeted to plastids where it degrades plastid 
targeting signals [79]. 

In terms of the processing site recognition by MPP in 
plants, it has been reported that the majority of plant 
mitochondrial presequences fall into 2 classes. In class 1 
the processing signal is a -2 Arg residue while in class 2 
presequences the signal is a -3 Arg residue [20,81], simi- 
lar to what has been reported for yeast and mammals 
[23]. However, it has been demonstrated that in fact 
there is a second processing step in yeast [82], a novel 
peptidase called intermediate cleavage peptidase (Icp55) 
was found to process mitochondrial presequences after 
MPP, cleaving only 1 amino acid from the N-terminus, 
turning the proposed -2 cleavage signal into a -3 clea- 
vage signal [78]. It is tempting to speculate that as 
plants contain an orthologue of Icp55 that the same 
cleavage is occurring, however this awaits experimental 
confirmation. Despite the orthology between many plant 
peptidases and those in other organisms it is still neces- 
sary to define their specific functions in plants. It has 
been demonstrated that the plant orthologue of the 
rhomboid protease from yeast does not carry out the 
same processing roles/activities in plants such as Arabi- 
dopsis [83]. 

Conclusions 

The plant mitochondrial import apparatus displays 
many differences compared to other non-plant organ- 
isms and between plant groups. The TOM complex in 
plants displays the most variability in that as many as 
five different TOM complexes exist in plants when red 
algae, brown algae and green plants are considered. 
Even in the green plant lineage variation is observed 



with OM64 only being present in monocot and dicot 
plants. While the composition of the other protein com- 
plexes may appear more conserved, the lack of biochem- 
ical characterization of these complexes in any plant 
group means that the presence of plant specific acces- 
sory subunits in various lineages cannot be judged. 
Additionally for some proteins, such as Mia40 and 
Timl7, functions have expanded in plants compared to 
those characterized in yeast. 

Methods 

The protein sequences for all of the known mitochon- 
drial protein import components from Saccharomyces 
cerevisiae (Tom20, Tom70, Tom71, Tom40, Tom22, 
Tom5, Tom6, Tom7, Sam50, Sam37, Sam35, MdmlO, 
Miml, Mia40, Ervl, Hotl3, Tim9, TimlO, Tim8, Timl3, 
Timl2, Tim22, Tim54, Timl8, Tim23, Timl7, Tim50, 
Tim21, mtHsp70, Mgel, Tim44, Paml8, Mdj2, Paml6, 
Paml7, MPPa, MPPp\ Octl, Impl, Imp2, Soml, Ytal2, 
YtalO, Ymel, Mgrl, Mgr3, Pcpl, Icp55, Oxal, Mbal, 
Coxl8, Pntl, Mss2, Mdjl, Hsp60, HsplO, Hsp78 and 
Ziml7) were downloaded from the NCBI protein data- 
base (http://www.ncbi.nlm.nih.gov/protein/). The 
Metaxin protein sequences were also obtained from 
Homo sapiens. Using the above protein sequences Blastp 
[84] searches of the protein sequences from Physcomi- 
trella patens, Selaginella moellendorffii, Chlamydomonas 
reinhardtii, Arabidopsis thaliana, Oryza sativa, Zea 
mays, Vitis vinifera, Glycine max and Populus tricocarpa 
were performed using the Phytozome (http://www.phyto- 
zome.net) database. Blastp [84] searches of Cyanidioschy- 
zon merolae were performed using the Cyanidioschyzon 
merolae genome project website (http:/ /merolae. biol.s.u- 
tokyo.ac.jp/). Blastp [84] searches of Ectocarpus siliculo- 
sus were performed at the Bioinformatics online genome 
annotation system website (http://bioinformatics.psb. 
ugent.be/webtools/bogas/overview/Ectsi). Blastp [84] 
searches of Chlorella variabilis NC64A genome [38] was 
performed at the Chlorella genome website (http://gen- 
ome.jgi-psf.org/ChlNC64A_l/ChlNC64A_l.home.html). 
To identify mitochondrial import components ofPicea 
glauca tblastn [84] searches were carried on EST 
sequences [43] at the NCBI website (http://blast.ncbi. 
nlm.nih.gov/Blast.cgi). 

All multiple sequence alignments were carried out 
using MAFFT [85] and visualized using Multiple align 
show (http://www.bioinformatics.org/sms/multi_align. 
html). The program IQPNNI [86] was used to recon- 
struct a maximum likelihood phylogeny assuming the 
Whelan and Goldman model [87]. Phylogenetic trees 
were finally visualized using the program Geneious 
(http://www.geneious.com). 

TMpred (http://www.ch.embnet.org/software/TMPRED_ 
form.html), TMHMM (http://www.cbs.dtu.dk/services/ 
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TMHMM/), and DAS (http://www.sbc.su.se/~miklos/ 
DAS/) [88] were used in the prediction of transmem- 
brane regions. TPR repeats were predicted using 
TPRpred (http://toolkit.tuebingen.mpg.de/tprpred) [89]. 
Motif analysis was performed using MEME (http:// 
meme.nbcr.net/meme4_4_0/cgi-bin/meme.cgi) using 
default parameters for all plant like Metaxin sequences 
and sequences of the plant Timl7 extensions [90]. 

Additional material 



Additional file 1: Supplementary table 1. The mitochondrial import 
machinery of plants 

Additional file 2: Supplementary table 2. The top 50 proteins 
identified using the conserved motifs on the C-terminal of plant 
Tim! 7 in a Blastp search 
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